(DOCSP-50370): Create new LangChain self-query retrieval notebook #21

davidhou17 · 2025-05-28T20:56:40Z

dacharyc

Everything works, but I've got a couple of nits with re-declaring stuff we've already declared, and some of the filter results. Non-blocking comments below!

dacharyc · 2025-06-27T14:58:16Z

ai-integrations/langchain-self-query-retrieval.ipynb

+    "from langchain_core.runnables import RunnablePassthrough\n",
+    "from langchain_openai import ChatOpenAI\n",
+    "\n",
+    "llm = ChatOpenAI(model=\"gpt-4o\")\n",


Nit: in the context of this notebook, we're re-declaring an llm we already declared up above in ln 233. I'd probably omit this line, and omit the related import from langchain_openai import ChatOpenAI in ln 343 above.

I also don't love re-declaring the retriever with one additional param. It would be great if we could set enable_limit when we initially declare the retriever in ln 234, and then remove the re-initializing here.

It makes sense to have these things on a docs page if we want this to be a stand-alone code example, but here in the context of the notebook, it's not needed.

Good catch!

dacharyc · 2025-06-27T15:03:16Z

ai-integrations/langchain-self-query-retrieval.ipynb

+   "id": "833d90d9",
+   "metadata": {},
+   "source": [
+    "### Queries with filters"


I got some query results that seem unrelated to the filter. i.e. for "toys", I got this document:

Document(id='685eaec1edc703d86a4c7201', metadata={'_id': '685eaec1edc703d86a4c7201', 'year': 1979, 'rating': 9.9, 'genre': 'science fiction'}, page_content='Three men walk into the Zone, three men walk out of the Zone')

For thriller and action, I got this document:

Document(id='685eaec1edc703d86a4c7203', metadata={'_id': '685eaec1edc703d86a4c7203', 'year': 1995, 'genre': 'animated', 'rating': 9.3}, page_content='Toys come alive and have a blast doing so')

I'm sure this is related to the limited amount of sample data we're providing, but it doesn't show the feature great to have these seemingly unrelated results being returned. I wonder if we want to add more sample data to show only obviously related results being retrieved?

Done! Added more data and improved the queries to produce so the outputs are more descriptive

davidhou17 force-pushed the DOCSP-50370 branch from da02802 to bd8f2f5 Compare May 28, 2025 21:10

create new notebook

a2b3560

davidhou17 force-pushed the DOCSP-50370 branch from bd8f2f5 to a2b3560 Compare June 26, 2025 17:33

davidhou17 requested a review from dacharyc June 26, 2025 17:38

dacharyc approved these changes Jun 27, 2025

View reviewed changes

davidhou17 added 3 commits July 9, 2025 10:09

review feedback

ae70be1

use voyage embeddings

7ee4fe4

update HF client

4049be4

davidhou17 force-pushed the DOCSP-50370 branch from 4dfc644 to 4a20715 Compare July 9, 2025 15:12

improve queries/data

52d7cff

davidhou17 force-pushed the DOCSP-50370 branch from 4a20715 to 52d7cff Compare July 9, 2025 15:12

use voyage embeddings

3c887e3

davidhou17 force-pushed the DOCSP-50370 branch from c79ee78 to 3c887e3 Compare July 9, 2025 15:23

davidhou17 added 2 commits July 9, 2025 11:33

add readme

a1a19b7

update field name

faf9e4f

davidhou17 merged commit c6847e3 into mongodb:main Jul 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

(DOCSP-50370): Create new LangChain self-query retrieval notebook #21

(DOCSP-50370): Create new LangChain self-query retrieval notebook #21

Uh oh!

davidhou17 commented May 28, 2025 •

edited

Loading

Uh oh!

dacharyc left a comment

Uh oh!

dacharyc Jun 27, 2025

Uh oh!

davidhou17 Jul 9, 2025

Uh oh!

dacharyc Jun 27, 2025

Uh oh!

davidhou17 Jul 9, 2025

Uh oh!

Uh oh!

(DOCSP-50370): Create new LangChain self-query retrieval notebook #21

(DOCSP-50370): Create new LangChain self-query retrieval notebook #21

Uh oh!

Conversation

davidhou17 commented May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dacharyc left a comment

Choose a reason for hiding this comment

Uh oh!

dacharyc Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

davidhou17 Jul 9, 2025

Choose a reason for hiding this comment

Uh oh!

dacharyc Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

davidhou17 Jul 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

davidhou17 commented May 28, 2025 •

edited

Loading